Deduplication in Yaffs
نویسندگان
چکیده
NAND flash memory has been the single biggest change to drive technology in recent years, with the storage medium showing up in data centers, laptops and in memory cards in mobile devices. It addresses the performance problems in storage. Data Deduplication is simple yet effective technology providing solution to the massive storage requirements of data. Deduplication removes redundant files/blocks of data and ensures that only unique data are stored. We embody the advantages of deduplication and solidstate disks making storage efficient, practical. We have implemented deduplication in YAFFS2, the NAND specific Flash file system on Android OS using chunk index, a compact in memory data structure for identifying new file chunks, which abates data storage limitations. The properties of solid-state devices are harnessed to reduce the complexity of implementation. We implement simple chaining and caching for further optimization. We show that the write time for duplicate data and the storage space has been reduced extensively.
منابع مشابه
Forensic analysis of the android file system YAFFS2
The popularity of Android devices has resulted in a requirement for a process to extract and analyse data in a forensically sound manner. There is a wide range of devices which use the Android operating system, and hence a standard process for forensic extraction and analysis for all devices is not possible. Many devices use the Yet Another Flash File System (YAFFS), which introduces an additio...
متن کاملA Flash File System to Support Fast Mounting for NAND Flash Memory Based Embedded Systems
In embedded systems, NAND flash memory is typically used as a storage medium because of its non-volatility, fast access time and solid-state shock resistance. However, it suffers from out-place-update, limited erase cycles and page based read/write operations. Flash file systems such as JFFS2 and YAFFS, allocate memory spaces using LFS (Log-structured File System) to solve these problems. Becau...
متن کاملOffline Selective Data Deduplication for Primary Storage Systems
Data deduplication is a technology that eliminates redundant data to save storage space. Most previous studies on data deduplication target backup storage, where the deduplication ratio and throughput are important. However, data deduplication on primary storage has recently been receiving attention; in this case, I/O latency should be considered equally with the deduplication ratio. Unfortunat...
متن کاملMetadata Considered Harmful...to Deduplication
Deduplication is widely used to improve space efficiency in storage systems. While much attention has been paid to making the process of deduplication fast and scalable, the effectiveness of deduplication can vary dramatically depending on the data stored. We show that many file formats suffer from a fundamental design property that is incompatible with deduplication: they intersperse metadata ...
متن کاملA Robust Fault-Tolerant and Scalable Cluster-wide Deduplication for Shared-Nothing Storage Systems
Deduplication has been largely employed in distributed storage systems to improve space efficiency. Traditional deduplication research ignores the design specifications of shared-nothing distributed storage systems such as no central metadata bottleneck, scalability, and storage rebalancing. Further, deduplication introduces transactional changes, which are prone to errors in the event of a sys...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010